Turning a Term Extractor into a new Domain: first Experiences
نویسندگان
چکیده
Computational terminology has notably evolved since the advent of computers. Regarding the extraction of terms in particular, a large number of resources has been developed: from very general tools to other much more specific acquis ition methodologies. Such acquisition methodologies range from using simple linguistic patterns or frequency counting methods to using much more evolved strategies combining morphological, syntactical, semantical and contextual information. Researchers usually develop a term extractor to be applied to a given domain and, in some cases, some testing about the tool performance is also done. Afterwards, such tools may also be applied to other domains, though frequently no additional test is made in such cases. Usually, the application of a given tool to other domain does not require any tuning. Recently, some tools using semantic resources have been developed. In such cases, either a domain-specific or a generic resource may be used. In the latter case, some tuning may be necessary in order to adapt the tool to a new domain. In this paper, we present the task started in order to adapt YATE, a term extractor that uses a generic resource as EWN and that is already developed for the medical domain, into the economic one.
منابع مشابه
Using Wikipedia for term extraction in the biomedical domain: first experiences
We present a term extractor that uses Wikipedia as an semantic information source. The system has been tested on a Spanish medical corpus. We compare the results obtained using a module of a hybrid term extractor and an equivalent module that use the Wikipedia. The results show that this resource may be used for this task.
متن کاملLog-Domain Circuits for Auditory Signal Processing
The theory and practice of log-domain filter design has reached the point where it is possible to incorporate log-domain filter structures into large current-mode VLSI systems. We report on interface circuits used to implement a current-mode frontend filterbank and feature extractor for acoustic pattern recognition. These circuits maintain a log-domain structure, acting on the unexpanded filter...
متن کاملP15: Hippocampus-Neocortical Communication in Learning
The hippocampus is located in the medial temporal lobe and is a part of the forebrain. It plays a critical role in formation of declared memories. The hippocampus is banana­-shaped and communicates with all parts of neocortex. Reptiles and birds have structures like hippocampus that potentially serve as navigation functions. During the mammalian evolution, the neocortex has a large expansio...
متن کاملExtracting terminology from Wikipedia
In this paper we present a new approach for obtaining the terminology of a given domain using the category and page structures of the Wikipedia in a domain and language independent way. The idea is to take profit of category graph of Wikipedia starting with a set of categories that we associate with the domain. After obtaining the full set of categories belonging to the selected domain, the col...
متن کاملLearning from Parenthetical Sentences for Term Translation in Machine Translation
Terms extensively exist in specific domains, and term translation plays a critical role in domain-specific machine translation (MT) tasks. However, it’s a challenging task to translate them correctly for the huge number of pre-existing terms and the endless new terms. To achieve better term translation quality, it is necessary to inject external term knowledge into the underlying MT system. For...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008